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Abstract. We consider bounds on the convergence of Ritz values from a sequence of Krylov 
subspaces to interior eigenvalues of Hermitean matrices. These bounds are useful in regions of low 
spectral density, for example near voids in the spectrum, as is required in many applications. Our 
bounds are obtained by considering the usual Kaniel-Paige-Saad formalism applied to the shifted 
and squared matrix. 
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1. Introduction. In this paper we derive a bound on how well an eigenvalue of 
a Hermitean matrix is approximated by the corresponding Ritz value obtained from 
its restriction to a Krylov subspace. Such bounds are useful in practice because no 
Krylov space based method for determining eigenpairs (such as Arnoldi or Lanczos 
algorithms) can do better than this even in exact arithmetic. 

We shall show that the eigenvalues A Q of an N x N Hermitean matrix A are 
approximated by its Ritz values 9 a from the Krylov space /C n (A, v) with an error 
bounded by 

2(A' - \'„)(KL tanZ') 2 e- 2 ("- 2 ^ +c, K' 2 ) 
a |A« — Aj 

where Aj > Af, > ■ ■ ■ > A^y are the eigenvalues of A' = —(A — A) 2 with j such that 

a;- = -(a-a q ) 2 , 

tsI _ TT A '' ~ A W „/ _ / A J ~ A j+1 

7=1 A * A 3 V V 1 N 

and Z.'j is the angle between the projection of v onto span(z J+ i , . . . , z^v) and Zj where 
Zi is an eigenvector of A' (and thus also of A) belonging to the eigenvalue X'j. 

In section f}2] we briefly indicate why the problem of finding approximations to 
interior eigenvalues of large Hermitean matrices is important in a variety of physical 
applications. In f}3]we review some basic properties of Hermitean matrices introducing 
the notation to be used throughout the paper, and in 2] we review some useful 
bounds on Ritz values for general subspaces. Section Q 5. II derives the bounds on the 
convergence of extremal eigenvalues due to Kaniel [2], Paige [4] and Saad [7]. We 
then present a straightforward but apparently new generalization that gives bounds 
for the interior eigenvalues close to "voids" in the spectrum in ^5 . 21 followed by some 
examples illustrating the application of our bounds. 



2. Applications. The problem of finding good approximations to interior eigen- 
values in low-density regions of the spectrum of a Hermitean matrix is common to 
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many areas of computational science, but the particular application that led us to 
consider this problem [5] is that of evaluating the Neuberger operator for lattice QCD 
(Quantum Chromodynamics being the quantum field theory of the strong nuclear 
force). This requires us to evaluate the signum function of the "Hermitean Dirac op- 
erator" 75-D, which is defined by diagonalizing this matrix and taking the sign (±1) 
of each of its eigenvalues. It is far too expensive to carry out the full diagonalization, 
so we use a Zolotarev rational approximation for the signum function as this can be 
evaluated just using matrix addition, multiplication, and inversion (using a multi-shift 
solver for its stable partial fraction expansion). This approximation is expensive for 
very small eigenvalues of 75-D, and as there are only a relatively small number of these 
we want to deflate them and take their sign explicitly. While the Neuberger operator 
is Hermitean, it is not positive; furthermore, for physically interesting parameters its 
spectrum has a large void around zero, with perhaps only a few "topological" or "de- 
fect" modes leading to discrete eigenvalues within this void. The relevant numerical 
problem is therefore to find the eigenpairs within or close to this void. 

Previously [Tj such eigenpairs had been found by finding the extremal eigenpairs 
of the square of the Neuberger operator, but this then required further effort to 
separate artificial degeneracies of the squared matrix arising from (almost) degenerate 
positive and negative eigenvalues of the original matrix. We observed that there is in 
fact no need to square the matrix, as the desired eigenpairs converge rapidly for the 
Neuberger operator itself, despite the fact that the eigenvalues are far from either end 
of the spectrum and thus the usual Kaniel-Paige-Saad bounds are not applicable. We 
used a new algorithm based on the Lanczos algorithm with selective orthogonalization 
(LANSO) method of Parlett and Scott [6] (which avoids the appearance of duplicates 
of eigenvalues due to rounding errors in finite-precision arithmetic computations) that 
allows us to find eigenpairs with eigenvalues of a Hermitean matrix A that lie within 
a specified interval. We shall describe this algorithm in a forthcoming publication. 

The fact that such interior eigenpairs converged rapidly led us to consider how 
to find realistic bounds in this case, and we were led to use the Kaniel-Paige-Saad 
bounds for the extremal eigenvalues of the shifted and squared matrix —(A — A) 2 to 
give bounds on the convergence of the original matrix A. 

We do not believe that applications in which such interior eigenvalues in low- 
density spectral regions are important are unusual. To give just one more example, 
we may consider the Hamiltonian for an electron in a metal, semiconductor, or in- 
sulator, which may be approximated by a large Hermitean matrix with "conduction 
bands" separated by large "band gaps" . The details of the eigenvalues deep within the 
bands are not too important: for example their separation is related to the inverse 
(macroscopic) volume of the system under consideration, but the eigenpairs at the 
edges of and between the bands determine many physically important properties. 

3. Basic Properties of Hermitean Matrices. In this section we summarize 
the basic properties of Hermitean (symmetric) matrices, with the goal of specifying 
our notation. 

A matrix A is Hermitean (with respect to a sesquilinear inner product) if A = A f , 
which means (u,Av) = (A^u, v) = (Au, v) = (v, Au)*, or equivalently • Av = 
(A^u)^ • v = (Au)t • v = (v^ • Au)*. An eigenvalue A of A satisfies Az = Az where 
z 7^ is the corresponding eigenvector. The eigenvalues are real 

A = (z,Az) = (z,A*z) = (z,Az)* = y 
(z,z) (z,z) (z,z)* 
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and the eigenvectors corresponding to different eigenvaiues are orthogonal: A(z', z) = 



(z',Az) = (Az,z')* = (z, Az')* = A'*(z,z')* = A'(z',z), hence (A' - A)(z',z) = 0, so 



A 7^ A' => (z',z) = 0. We can choose eigenvectors belonging to the same eigenvalue 
to be orthonormal, for example by using the Gram-Schmidt procedure, as any linear 
combination of such eigenvectors is also an eigenvector. 

Since any matrix can be reduced to triangular form T by a unitary (orthogo- 
nal) transformation^ (change of basis), A = UTIT 1 = UTU t , and T* = (U t AU) t 
U ' A L = U^AU = T, it follows that T is real and diagonal, and thus AU = UT so the 
columns of U furnish the orthonormal eigenvectors. 

4. Subspace Bounds. 

4.1. Rayleigh Quotient. We consider how the eigenvalues and eigenvectors of 
an approximation to a matrix A are related to the actual eigenpairs. Consider the 
Rayleigh quotient which has the spectral representation 



This is a weighted mean of eigenvalues, so it satisfies the inequalities Ai > p(u, A) > 
A at, where N — dim A and we have ordered the eigenvalues such that Ai > A 2 > • • • > 



4.2. Ritz Pairs. The extrema of the Rayleigh quotient may be found by varying 
the components ui = (z^, u) of u subject to the constraint that ||u|j = 1. Introducing 
a Lagrange multiplier p we find (A — p)u = 0, whose solutions are the eigenpairs 
p = Xj with u = Zj for j = 1, . . . , N. If we restrict u to an m-dimensional subspace 
X C R N then the extrema of the Rayleigh quotient are the Ritz pairs p = 9j with 
u = yj for j = 1, . . . , to. The Ritz pairs are the eigenvalues and eigenvectors of the 
restriction of A to the subspace, H = Q^AQ, that is Hsj = 9jSj. Here Q is an N x m 
matrix with orthonormal columns (Q^Q = 1) and QQ^ is an orthogonal projector 
onto X. Note that the vectors yj = Qsj 6 are not eigenvectors of A, although 
Q^(A — 0j)yj = 0, since the residual R = AQ — QH ^ in general. The Rayleigh 
quotient p(u € X, A) restricted to the subspace X is a weighted mean of Ritz values 
so 9\ > p(u S X, A) > 9 m where we have ordered the Ritz values 9\ > 62 > ■ ■ ■ > 9 m . 
For large matrices our strategy is to compute the Ritz pairs of A for suitably chosen 
subspaces X; so we are interested in how well these may approximate the eigenvalues 
of A itself. 

4.3. MaxMin and MinMax Bounds. Let Sj C X be an arbitrary sub- 
space of dimension j, and Cj-% C X another arbitrary subspace of codimension 
j — 1, i.e., a subspace of dimension to — j + 1 if m < 00. There must be a non- 
zero vector v e Sj fl Cj-i) where of course v depends upon Sj and Cj_\\ there- 
fore mhiugs^ p(u, A) < jo(v,A) < max ueCj _ 1 p(u, A). Hence, for any pair of sub- 
spaces Sj and Cj_i, mirx u gs, p(u, A) < maxugp.j p(u, A). Taking the maximum 
over all Sj and the minimum over all Cj-\ we obtain max^cx minugs^ p(u, A) < 
minc' J _iCX niax ue c i _ 1 p(u, A). For the particular subspace Sj = span(yi, . . . , yj) the 




1 This is "Schur normal form" , which follows from the Cayley-Hamilton theorem that every matrix 
satisfies its characteristic equation, and the fundamental theorem of algebra which states that the 
characteristic polynomial p(A) = det(A — A) has exactly dim A complex roots, counting multiplicity. 
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minimum of the Rayleigh quotient is 9j , likewise for C,-_i = span(yj , . . . , y m ) its max- 
imum is also 9j, so 9j < max Sj cx rnin ueSj p(u, A) < vamc^cx max u6Cj _i p(u, A) < 
8j-.we have thus established the MaxMin and MinMax bounds 

max min p(u, A) = = min max p(u, A). 

Sj-cxueSj Cj-iCXueCj-i 

4.4. Krylov Spaces. An interesting family of subspaces are the Krylov spaces 

AC (A, v) = span(v, Av, A 2 v, . . . , A" _1 v) 

where v ^ is some arbitrary vector. The Ritz values of A in a Krylov space are non- 
degenerate, as otherwise we could construct a Ritz vector y ^ such that (y, v) = 0: 
this would imply that (y, A fe v) = 9 k (y,v) = Vfc > contradicting the assumption 
that y £ /C„(A,v). 

A similar argument shows that for Ritz pairs 9, y and 9' , y' there is a unit vector 
u G span(y,y') with (u, v) = 0, for which (u, A fc v) oc (6 — 0')||v||, indicating that 
although we may hnd good approximations in a Krylov space for nearly-degenerate 
eigenvalues, we should not expect the same to be true for the corresponding eigenvec- 
tors. 

5. Error Bounds. 

5.1. Kaniel Paige Saad Bounds. In this section we briefly survey the deriva- 
tion of the Kaniel-Paige-Saad bounds on the extremal eigenvalues of Hermitean ma- 
trices. Our notation is similar but not identical to that of Parlett [5]. 

Let us now study how rapidly we may expect eigenvalues to converge to a given 
accuracy in the sequence of Krylov spaces of increasing dimension. We shall follow the 
method introduced by Kaniel [2] and corrected and extended by Paige [4] and Saad [7] . 
Consider the angle Z(zj, u) between an eigenvector Zj and a vector u £ AC„(A, v). We 
shall assume that the initial vector is not orthogonal to the eigenvector, (zj,v) 7^ 0: 
this would be very unlikely for a randomly chosen v. Indeed, we expect than on 
average |(zj,v)| ~ 1/\N if v is a random unit vector. 

Consider the subspace X C /C„(A, v) C R N : with X — span(yj, y J+ i, . . . , y„) 
where the yj are Ritz vectors for IC n (A — Aj,v) = /C n (A, v) for which the MaxMin 
bound gives the first and last inequalities in 

9a — A, = max min p(u, A — A, ) < max min p(u, A — A, ) = 0, 

the central inequality following from the inclusion X C R N . p(u £ X, A) < 9j as it is 
a weighted mean of 9j , 8j+i , . . . , 9 n , hence p(u £ X,A—Xj) < 9j — Xj, or equivalently 

< Xj - 9j < p{u £ X, X 3 - A). 

Any u in the Krylov space can be written asu = p(A)v where p is a polynomial 
of degp < n — 1. A sufficient condition for u = p(A)v to be in X is that the 
polynomial p has the Ritz values 9\, 92, ■ ■ ■ , 9j-i as roots, p{9{) = for i < j. If 
f>(A) = (A — 9i)q{k) with q some polynomial of degree < n — 2 then (yi,u) = 
(y i( p(A)v) = ( yi , (A - 9i)q(A)v) = ((A - Ofa, g(A)v) = (w,g(A)v) = 0, where 
w is a vector orthogonal to JC n (A, v) and hence to q(A)v £ /C n _x(A, v). Therefore 
a sufficient condition for u £ X is that p(x) — q(x)l(x) with degq < n — j and 
K x ) = Ilti e^x- ■ Although we do not need it here it is easy to prove that this is 
also a necessary condition. 
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The Rayleigh quotient for a vector u = p(A)v = ^^ =1 p(Afc)z/ c (zfc, v) is 

The terms in the numerator for k < j are all negative, so we may bound p above by 
dropping them; likewise we only increase the bound by replacing Xj — Xk by Xj — Ajy 
for the remaining terms in the numerator, and dropping all the terms with k =/= j in 
the denominator, leading to 



p(ueX,Aj-A) < (A^-Ajv)- 



, E fc =j+i b(Afc)(zfc,y)| 2 

|p(Aj)( z j» v )l 

From this we immediately obtain the desired bound using Kaniel's choice of q as the 
Chebyshev polynomial T„_j o jj, where 

7 ^ A J = 2 ^ \ 1 = 1 + 2- — 

maps the interval [Ajv,Aj+i] i— > [—1, 1], hence (q.v., Appendix [A~|l 

max |g(A)| = max |T„_j(a;)| < 1 

AG[A N ,Aj + i] x£[— 1,1] 

Furthermore /(A,,) = 1, so if we define Kj = \1(Xn)\ > maxA £ [i Ni A i+1 ] |^(A)| we obtain 

P (u e i, V A)<^M^. 

T n _ i (7 J -(A i )) 

y-^iV 1/ ^ v)| 2 

Here (tanZj) 2 = — i so is the angle between the projection of v onto 

span(z :)+ i, . . . , zn) and Zj. Note that in the interesting case where 9i « Ai for i < j 
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Xj ~ . A; A, 

From the bounds on Chebyshev polynomials given in Appendix [S] we get 

< Xj - 9j < A(Xj - X N )(Kj t&nZ 3 ) 2 e-^ n -3 ) ^ + °^ ) 

where jj(Xj) = 1 + 2rf with (ij = \J ■ and thus ln (7j(Aj) + \fl 3 ( X i) 2 ~ 
2 H + 0(fi 2 ). 

5.2. Bounds for Interior Eigenvalues. We will produce a bound for an inte- 
rior eigenvalue by introducing a shift A and considering the spectrum of the shifted 
and squared matrix A' = —(A — A) 2 . The parameter A is only needed for our error 
bound and has no effect on the Krylov space Ritz values; hence the best bound will 
just be the minimum over all possible values of A; a practical bound can be found by 
a judicious choice of A. 

Notice that the "interior" eigenvalues of A near A are the largest eigenvalues of A'. 
Furthermore, the Krylov space for A' is contained within that for A, albeit with twice 
the size (but the same number of applications of A): /C„(— (A — A) 2 , v) C /C2n(A, v). 
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We will establish that the Kaniel-Paige-Saad bounds for /C n (A',v) must therefore 
also hold for the latter. 

Suppose that the spectrum of A has an interior eigenvalue A a . As usual, we shall 
write the eigenvalues in decreasing order Ai > • ■ • > \ a -i > X a > ■ ■■ > Ajv, as we 
shall do for the corresponding eigenvalues of A', A^ > • • ■ > A^_j > \'j = — (X a — A) 2 > 
Aj +1 > X' N . The smallest eigenvalue of A' is X' N = min(-(Ai - A) 2 , -(Ajv - A) 2 ). 

We apply the Kanicl Paige Saad bound to the matrix A' in the Krylov space 
£„(A', v) whose Ritz pairs are 9j,y'j for j = 1, . . . , n. Following the arguments of the 
preceding section we find that the Ritz value 9j satisfies A'- > 9'j > p(u G A', A') for 
any vector u S X' = span(y^-,y^ +1 , . . . ,y' n ) and the Rayleigh quotient is bounded by 
choosing u = p'(A')v for the polynomial 

This leads to < A^- - 9'- < p(u € A', X'j - A') with 

(5.1) p(u e X', Xj - A') < 4(A;- - A' A r)(A';-tanZ;-) 2 e _4( "^ ) ^ +0( ^ 2) = e' A , 
where 



jy-i — tt ®i ~ n ~ TT * ~ N ,,' - / o J+ 1 
3 — LL a* y ~ 11 \' \' ' ' i ~ \ y y > 

t=i " X J t=i X i _ X i V A j+i A n 

and Z'j is analogous to Zj (the eigenvectors of A' are the same as those of A but their 
ordering is different). 

Let v = \\ a — A|, £ = \J—0j > (recall that with our conventions both A^ < 

and ff. < 0), and 8 = A^ - % = -v 2 + £ 2 = (£ - z/)(£ + v) with < 8 < e' A . Then 

4; -to since 8>Q^i-v>Q^£ + v=(^-v) + 2v>2v. Hence 

we obtain the following bound on the error in the Ritz value 9 a = A ± \J—0j of A 

corresponding to the eigenvalue X a of A obtained from the Ritz value 9'j of A' in the 
Krylov space /C„(A',v): 



(5.2) |A a -0 o |=min 



= £-i/< zA = 



2v 2|A a -A| 

Furthermore, since /C„(A',v) C fori (A, v) C 1* the MaxMin inequalities 

max min p(u, A) < max min p(u, A) < max min p(u, A) 

S Q C/C„(A' : v) ueS Q S Q C/C 2 „(A : v) ueS Q S Q CK« u6 S Q 



imply that 9 a < 9 a < X a . This means that the bound (|5.2[) on the error of the Ritz 
value # Q from the Krylov space K, n (A' , v) bounds that of the Ritz value 9 a from the 
Krylov space (A, v), A Q — 9 a > X a — 9 a > 0. 



5.3. Discussion. The advantage of the bound given by equations (|5.ip and ()5.2|) 
is that although the exponents pi will in general be smaller than those from the original 
Kaniel-Paige-Saad bound the prefactor K' may be far smaller in the interior of the 
spectrum. 
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Furthermore, p! may not be much smaller than fi for eigenvalues in low density 
parts of the spectrum for a good choice of the parameter A. For example, suppose 
that A Q is is just below a large void in the spectrum of width A, but not necessarily 
immediately adjacent to it, and is also close to X a+ \. We can choose A near the 
middle of void such that A^ +1 = — (X a +i — A) 2 . For notational simplicity let us also 
assume that A^ = — (Ajv — A) 2 , so 

, 2 _ _ -(Aa -A) 2 + (A Q+1 -A) 2 _ (A a - A a+1 )(2A-A a -A a+1 ) 

Mi 



X' j+1 - X' N -(Aa+i - A) 2 + (X N - A) 2 (A a+ i - \ N ) (2A - A Q+1 - X N ) 
Now, 2A - (A Q + A a+ i) w A, hence 



2A — X a — X a+ i 



^ ~ N V 2A - A Q+1 — Atv ~ W V 2A - (A a+1 + X N ) ~ ^ V Xi X N 

which is only slightly smaller than if A is a significant fraction of the spectrum. 

The limit on how far A can be from A Q and A Q +i is that when it gets closer to a 
dense band of eigenvalues the opposite side of the void the ordering of the eigenvalues 
of A' will change and X'j +1 < — (A a+ i — A) 2 . A few eigenvalues within the void do not 
matter, as they can be "removed" by a suitable zero in the polynomial p' , at the cost 
of a smaller K' factor and a slightly smaller factor n — j in the exponent. 

Amusingly, for (near) extremal eigenvalues /U = p! as we may take A — > ±00. 

Nevertheless, there is always an extra factor of two in the exponent arising from 
the fact the Krylov space £2™ (A, v) is twice as large as /C„(A',v), so asymptotically 
for large n the original bound is always better. This is not too important for two 
reasons: firstly because we are usually interested in getting approximations within 
some specified error rather than in the limit where the error tends to zero, and secondly 
since the Ritz values are exactly equal to the eigenvalues for n — N in exact arithmetic 
(or for smaller values of n if the spectrum of A is degenerate) there is a natural upper 
limit on the dimension n of the Krylov space. 

6. Examples. 

6.1. Qualitative Example. As our first example, consider anJVxJV matrix A 
whose spectrum lies in the unit interval. This matrix has k bands, each containing 
N/k eigenvalues, and each band is separated by a "void" of size A. Within each band 
the eigenvalues are separated uniformly by 6 = [1 — (k — 1)A]/N. 

If A = then all the eigenvalues are uniformly spaced, so fij cx VS = 1/y/N for 
any Xj, and the Kaniel-Paige-Saad bound shows that to resolve the eigenvalues we 
require a Krylov space of dimension n oc v^V- For eigenvalues in the interior of the 
spectrum the Kj factors become large, of course. 

If S = then A has k eigenvalues each with a degeneracy of N/k, and there are 
only k Ritz pairs. If S -C 1 then the error in the Ritz values \6— A|«<5forn«fc, but 
we require n oc l/VS in order to resolve the eigenvalues within each band (at least 
until the Ritz values become exact for n = N). 

If A ^> S then the exponent \j! w y/S for an interior eigenvalue A close to a void 
for shift A w A ± A/fc, so to resolve it we also need n cx v^V- This is the same scaling 
behaviour as for the usual bound, but the key difference between our bound and the 
usual one is that K' <C K. As A — >• 6 the scaling behaviour crosses over to n oc N 
since fi' w [i 2 . Perhaps this should not be too surprising as the Ritz values become 
exact for n — N. 




Fig. 6.1. Convergence of Ritz values for the matrix A whose spectrum is shown on the right. The 
plot shows the logarithm of the magnitude of the difference between each Ritz value and the nearest 
eigenvalue of A as a function of the dimension n of the Krylov space. The initial vector for the 
Krylov spaces was taken to have exactly equal overlap with each eigenvector. The black "bands" in 
the spectrum are actually equally-spaced eigenvalues with a spacing of 0.05. The smallest eigenvalue 
is A4g = —13.2. The squares show the errors for the actual Ritz values, and their colour indicates 
the nearest eigenvalue. The solid lines are the bounds from the shifted and squared matrix A' with 
shifts A chosen to give the smallest bound for an error of 10 -8 , while the dashed lines are the 
original Kaniel-P aige-Saad bounds. The shifts used were A — ¥ oo for Ai and A = 0.45 for A23, A24, 
and A25 . The solid line for A24 lies under that for A25 .' this is because they are equidistant from the 
shift value. The Ritz values coincide with the eigenvalues for n = dim A = 46. 



6.2. Quantitative Example. Our second example is more quantitative. We 
consider a matrix with two bands of closely-spaced eigenvalues and a few other eigen- 
values well-separated from these bands. In order to determine the value of the shift 
A we need to specify the value of n — the dimension of the Krylov space IC n (A, v) — 
at which the bounds are to be evaluated. We do this implicitly by choosing the value 
of A for which a bound of some specified magnitude, in this case 10~ 8 , is achieved 
for the smallest n. We compute an approximation to such a A by minimizing the 
exponent //. To do this we note that since d/i' /dA ^ anywhere yl can only have 
an extremum as a function of A when two eigenvalue of A' become degenerate, as at 
such points the correspondence between the eigenvalues of A and those of A' changes 
so as to maintain the correct order of the A'. We stress that these approximations 
merely mean that we may not have chosen quite the optimal value for A, nevertheless 
the bounds obtained are still valid since they hold for any A. 

The convergence of the Ritz values is illustrated in Figure RTT1 This clearly shows 
that our new bounds (solid lines) are significantly tighter than the usual Kanicl Paige 
Saad bounds (dashed lines) for the well-separated interior eigenvalues, although as 
expected the original bounds are better for the extremal eigenvalues. 
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This example illustrates some other interesting properties of the bounds. Observe 
that the lowest eigenvalue in this example, A46 = —13.2, is almost degenerate with the 
second-lowest eigenvalue, A45 = —13.1. This means that its bounds correctly indicate 
that it will only converge slowly. Interestingly the bound for latter is better than that 
for the former because this small gap appears in the coefficient K2 rather than the 
exponent fi\ of the extremal bounds for —A. 

If we did not know the spectrum beforehand, but tried to determine the bounds 
from the Ritz values from /C n (A,v) for some large value of n, we might be misled 
into thinking that the lowest eigenvalue was exactly degenerate, and thus obtain a 
much tighter bound on its convergence, and this bound would seem to agree with the 
empirical convergence of the Ritz value. Only when n becomes sufficiently large to 
resolve the splitting is this bound violated, there being a "plateau" in the error for 
this Ritz value until the Krylov space is able to separate the actual eigenpairs. The 
fact that the eigenvalue is nearly degenerate may be determined even for small n by 
"restarting", that is constructing the Krylov space /C n (A, v') for some different initial 
vector v', but this is insufficient to distinguish between exact and near degeneracies. 

It is often the case in applications that we are interested in finding eigenvalues 
and their eigenspaces up to some specified accuracy: for example, the matrix may be 
derived from a model that is only valid up to this accuracy. In such cases the bounds 
determined from the approximate spectrum may be more useful than the "exact" 
bounds, as might the "exact" bounds obtained from the approximate spectrum in 
which nearly degenerate eigenvalues are collapsed into exactly degenerate ones. This 
situation is common to both our shifted and squared bounds and to the original 
"extremal" bounds. 

7. Acknowledgements. We gratefully acknowledge the support of the Centre 
for Numerical Algorithms and Intelligent Software (EPSRC EP/G036136/1) in the 
preparation of this work. 

Appendix A. Chebyshev Polynomials. 

Chebyshev polynomials of the first kind may be defined as T n (x) = cos(n cos -1 a;), 
so it is immediately obvious that |T n (x)| < 1 for \x\ < 1. It is perhaps surprising that 
it is a polynomial in x, but application of the binomial theorem shows that 

w = g E ( T ) ( i ) i-vw™, 

so it is indeed a polynomial of degree n. Moreover it is also easy to see that 

^n\n(^x-\-\/x 2 — l) _|_ ^nh\{x— *Jx 2 — XJ 

we thus deduce that T n {x) > ^"M^V^T) for x > L 
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